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Abstract. The purposes of this paper are two fold. First, we extend the method of iron- homogeneous 
harmonic analysis of Nazarov, Treil and Volberg to handle "Bergman-type" singular integral op- 
erators. The canonical example of such an operator is the Beurling transform on the unit disc. 
Second, we use the methods developed in this paper to settle the important open question about 
characterizing the Carleson measures for the Besov-Sobolev space of analytic functions on the 
complex ball of C d . In particular, we demonstrate that for any a > 0, the Carleson measures for the 
space are characterized by a "Tl Condition". The method of proof of these results is an extension 
and another application of the work originated by Nazarov, Treil and the first author. 



1. Introduction and Statements of results 

We are interested in Calderon-Zygmund operators that do not satisfy the standard estimates. 
In particular, these kernels will live in M. d but will only satisfy estimates as if they live in M. m for 
m < d. More precisely, we are interested in Calderon-Zygmund operators whose kernels satisfy 
the following estimates 

Ccz 



\k(x,y)\ < 



- y\ 

and 



\k(y,x) - k(y,x')\ + \k(x,y) - k(x',y)\ < C C z 



x'\ T 



x _ y\m+T 



provided that \x — x'\ < ^\x — y\, with some (fixed) < r < 1 and < Cqz < oo. Once the kernel 
has been defined, then we say that a L 2 (M. d ; fj,) bounded operator is a Calderon-Zygmund operator 
with kernel k if, 

T vf( x )= K x ,y)f(y)dn(y) Vx^supp/. 

Jm d 

If k is a nice function, then the integral above can be defined for all x and then it gives one of 
the Calderon-Zygmund operators with kernel k. It is for this reason, that when given a not so 
nice Calderon-Zygmund kernel people consider the sequence of "cut-off" kernels and the uniform 
boundedness of this sequence. In the applications that we have in mind for this paper, all our 
Calderon-Zygmund operators can be considered a priori bounded (for example in future arguments, 
we can always think that [i is compactly supported inside the (complex) unit ball), and we will 
be interested in the effective bound, in terms of the parameters Cqz^t and in terms of a certain 
Tl condition we explain below. Frequently, this is the better view point to adopt instead of the 
"cut-off" approach. 

For kernels that satisfy these types of estimates and for virtually arbitrary underlying measures, 
a deep theory has been developed in [13-16,25]. The essential core of this theory showed that in this 
situation, one can develop the majority of Calderon-Zygmund theory and study the boundedness of 
the associated singular integral operators via a "Tl Condition". This theory has found applications 
in Tolsa's solution of the Painleve Problem, [20-22]. The theory of non-homogeneous harmonic 
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analysis has also come to find applications in geometric measure theory in the works of Tolsa [23], 
Tolsa, Ruiz de Villa [17], Mayboroda, Volberg [11,12], Eiderman, Nazarov, and Volberg [6]. The 
main results of this paper provide yet another application of the methods of non-homogeneous 
harmonic analysis. 

Having in mind the application to Carleson measures in the complex unit ball, we wish to extend 
the theory of Nazarov, Treil and the first author [13-16,25] to the case of singular integral operators 
that arise naturally as "Bergman-type" operators. These will be operators that will satisfy the 
Calderon-Zygmund estimates from above, but we (again having in mind the above mentioned 
application, see further) additionally allow them to have the following property 

" max(d(x) m ,d(y) m ) ' 

where d(x) := dist(x, M. d \ H) and H being an open set in M. d . The examples that the reader should 
keep in mind are the Calderon-Zygmund kernels built from the following function 

1 



k(x,y) 



(1 - x ■ y) r 



where H = B d , the unit ball in Mr. These are the standard "Bergman-type" kernels that arise 
naturally when looking at complex and harmonic analysis questions on the unit disc D and more 
generally in several complex variables. When we have a kernel k that satisfies the Calderon- 
Zygmund estimates and the additional property of measuring "distance to some open set" as 
above, we will let 

T^ m (f)(x) = k(x,y)f(y)d/i(y) . 

JR d 

In applications, we will be viewing the Calderon-Zygmund kernels that arise as living on certain 
fixed sets. Accordingly, we will say that A; is a Calderon-Zygmund kernel on a closed X c R d if 
k(x, y) is defined only on X x X and the previous properties of k are satisfied whenever x, x', y £ X. 

Our first main result is the following theorem, providing a link of the work of Nazarov, Treil and 
Volberg to the context of these Bergman-type operators. 

Theorem 1 (Main Result 1). Let k(x,y) be a Calderon-Zygmund kernel of order m < d on 
X C with Calderon-Zygmund constants Ccz and r. Let fi be a probability measure with 
compact support in X and suppose that all balls such that fj>(B(x,r)) > r m lie in an open set H. 
Let also 

|fc(x ' y)l " m a x(d(x) m ,d(y) m ) ' 

where d(x) := dist(x,M d \ H). Finally, suppose that for all cubes Q a 'Tl Condition" holds for the 
operator T^ m with kernel k and for the operator T* m with kernel k(y,x): 

l|2kmXgl||3(Rd; M ) < Afi(Q) , \\T^ m XQ\\ 2 L 2 { ^ d .^ < An{Q). (1.1) 
Then WT^mW^^d.^^^d.^ < C(A,m,d,T). 

The balls for which we have fi(B(x,r)) > r m are called "non-Ahlfors balls". Non-Ahlfors balls 
are enemies, their presence make the estimate of Calderon-Zygmund operators basically impossible 
(often the boundedness of a Calderon-Zygmund operator implies that non-Ahlfors balls do not 
exist). The key hypothesis is that we can capture all the non-Ahlfors balls in some open set H. To 
mitigate against this difficulty, we will have to suppose that our Calderon-Zygmund kernels have 
an additional estimate in terms of the behavior of the distance to the complement of H (namely 
that they are Bergman-type kernels). 

The method of proof of this theorem will be to use the tools of non-homogeneous harmonic 
analysis as developed by F. Nazarov, S. Treil, and the first author in the series of papers [13-16] 
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and further explained in the book by the first author [25]. The key innovation is the use of the 
Bergman-type kernels and the ability to control them in some appropriate fashion. 

The proof of Theorem 1 arose in an attempt to characterize the Carleson measures for the 
Besov-Sobolev spaces of analytic functions on the unit ball E>2d in C d . Roughly speaking, the 
Besov-Sobolev space of analytic functions B% (^>2d) is the collection of analytic functions on the 
unit ball such the derivative of order i — a belongs to the classical Hardy space H 2 (J&2d)- An 
important question in the study of these spaces is a characterization of the measures [i for which 

/ \f(z)\ 2 d^(z) <C(p)\\f\\ 2 d) . 

These measures are typically called Carleson measures and the characterization that one seeks is 
in terms of "natural" geometric test. These ideas are explained in more detail in Section 2.2. 

For the range i < a, the characterization of the Carleson measures is a well-known simple 
geometric condition. Roughly speaking, the /i-measure of a ball should be comparable to the 
Lebesgue measure of the same ball to some appropriate power (depending on a). See for example 
[27] for the Hardy space and the Bergman space. For the range < a < \ these Carleson measures 
in the complex ball were initially characterized by Arcozzi, Rochberg and Sawyer. In [2] Arcozzi, 
Rochberg and Sawyer developed the theory of "trees" on the unit ball and then demonstrated 
that the inequality they wished to prove was related to a certain two-weight inequality on these 
trees. Once they have the characterization of the trees, they can then deduce the corresponding 
characterization for the space of analytic functions. Again, in the range < a < \ a proof more in 
the spirit of what appears in this paper was obtained by E. Tchoundja, [18, 19] . The characterization 
of Carleson measures via capacitary conditions can be found in work by Ahern and Cohn, [1]. 

An important open question in the theory of Besov-Sobolev spaces was a characterization of the 
Carleson measures in the difficult range ^ < a < |, see for example [3]. It is important to emphasize 
that key to both the approaches of Arcozzi, Rochberg, and Sawyer and that of Tchoundja was a 
certain "positivity" of a kernel. The tools of non-homogeneous harmonic analysis developed by 
Nazarov, Treil and Volberg were specifically developed to overcome the difficulty when a natural 
singular integral operator lacks positivity. It is not at all clear how (or if it is even possible) to 
adapt the methods of [2, 18, 19] to address the more general case of \ < a < ^. This leads to the 
second main result of this paper, and a new application of the theory of non-homogeneous harmonic 
analysis. In C d , let l>2d denote the open unit ball and consider the kernels given by 

K m (z,w) :=Re- ^— -, V|z| < 1, |w| < 1 . 

(1 — z ■ w) m 

Theorem 2 (Characterization of Carleson Measures for Besov-Sobolev Spaces B^^d))- Suppose 
that < a . Let /j, be a positive Borel measure in 1 S&2d- Then the following conditions are equivalent: 

(a) n is a B 2 T (M2d) -Carleson measure; 

(b) T^2a ■ L 2 (B 2d ;/i) -> L 2 (n 2 d;n) is bounded; 

(c) There is a constant C such that 

(i) 1 1 2^ )2o -XQ 1 1 |a (B 2t , ~ C ' f° T aU A " cu&es Q>' 

(ii) n(B A (x,r)) < C r 2a for all balls B A (x,r) that intersect C d \M 2 d- 

Above, the sets B A are balls measured with respect to a naturally occurring metric in the 
problem. The operator T^^a is a Bergman-type Calderon-Zygmund operator with respect to this 
metric A for which we can apply (an extended version) of Theorem 1. And, the set Q is a "cube" 
defined with respect to the metric A. See Section 2 for the exact definitions of these objects. 

We remark, that when < a < ^ this result was previously obtained in [2, 18, 19] and when | < a 
the characterization is now classical, see for example the references in [27]. The key contribution 
of this theorem is the characterization of Carleson measures in the range \ < a < ^ , the so called 
"difficult range". 
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In this paper, constants will be denoted by C throughout, and can change from line to line. 
Frequently, the parameters that the constant depend upon will be indicated. Also, we use the 
standard notation that X < Y to mean that there is an absolute constant C such that X < CY, 
and X « Y to indicate that both X < Y and Y < X. 

The authors wish to thank a skilled and thorough referee who pointed to areas where the expo- 
sition and clarity of the article could be improved. 



2. Applications of the Main Theorem: Proof of Theorem 2 

We now show how one can use the main result of this paper, Theorem 1 (and its metric version), 
to deduce several interesting corollaries. 

2.1. Necessary and Sufficient Conditions for "Bergman type" Operators. We now point 
out the situation that motivated Theorem 1. In C d , let ]B>2d denote the open unit ball and consider 
the kernels given by 

K m (z,w) := Re- * — r— , \/\z\ < 1, \w\ < 1 . 

(1 — z ■ w) m 

Let \x be a probability measure with compact support contained in the spherical layer 1/2 < \z\ < 
1 and in particular the support is strictly inside the ball. We will see that this kernel satisfies the 
hypotheses of Theorem 1 when H = B>2d> but with respect to a certain (non-euclidean) quasi-metric 
A. It will be obvious that we have the estimate 

\ Km{ - Z ^\ " max(d(z) m ,d(w) m ) ' 

In application of the Main Theorem we need the quantity d(z) which was defined with respect to 
the metric of interest. We will see that for z £ ]B>2d this non-euclidean d(z) will still be 1 — \z\: 
d(z) := dist A (^,C d \ B 2 <f) = 1 - \z\. Since if z,w € M 2d we have |1 - z ■ w\ m > (1 - \z\) m and a 
similar statement holding for w. 

We introduce the above mentioned (quasi)-metric on the spherical layer around dM2d- 



A(z,w) := \\z\ — \w\\ + 



z w 
\z\ \w\ 



1/2 < \z\ < 2, 1/2 < \w\ < 2. 



Then it is easy to see that for all z, w : \z\ < 1, |u>| < 1, we have 

1 



\K m {z,w)\ < 



A(z,w) m ' 
This holds because we know that 

< 1 

~ A(z,w) rr ' 



(1 - z ■ w) 1 

by [4,18,19]. Also in [18,19] it is proved that if A((,w) « A(z,w) then 

A(C w) 1/2 

This estimate then says that the kernel K m is a Calderon-Zygmund kernel with associated Calderon- 
Zygmund parameter r = 1/2 defined on the closed unit ball, but with respect to the quasi-metric 
A(z,w). Finally, let fi be any probability measure compactly supported in the ball M2d- 

For the Calderon-Zygmund kernel K m given above, we have the associated operator m given 

by 

T^m{f){x) := / K m (x,y)f(y)dn(y) 
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is a Calderon-Zygmund operator of order m on the closed unit ball of M d which satisfies the 
hypotheses of the Theorem 1 on the kernel K m and measure /x, however with respect to the quasi- 
metric A(x,y). If n is compactly supported in the ball, the integral above converges absolutely. 
Otherwise we will consider the approximations of ji having supports compactly contained in the 
unit ball, and we will be interested in the uniform estimates of such operators. We must now argue 
that this presents no problem in the application of Theorem 1. 

There is the following analog of Theorem 1 pertinent to new metric corresponding to A. 

Theorem 3 (Main Result 2). Let k(z,w) be a Calderon-Zygmund kernel of order m < 2d on 
X := {1/2 < \z\ < 2} C C d , with Calderon-Zygmund constants Cqz o.nd r, but with respect to the 
metric A introduced above. Let [i be a probability measure with compact support in X D M 2 d, and 
suppose that all balls in the metric A such that fj,(B^(x,r)) > r m lie in an open set H. Let 
also 

\ k ( Z > W )\ ^ (At \m A( \m\ > 

max(a(z) , d{w) m ) 

where d(z) := distA(z,C rf \ H). Finally, suppose also that a "Tl Condition" holds for the operator 
T with kernel k and for the operator T* with kernel k(w, z). Namely, for all A-cubes Q we have: 

\\T^ m XQ\\h { x^) < AfJ,(Q) , ||^ )m XQ||| 2(Rd;A() < Au(Q) . (2.1) 

Then llT^II^^)-^^) < C(A,m,d,T). 

Theorems 1 and 3 will be proved in the last section. However, now we wish to provide a few 
remarks concerning the A-cubes Q appearing in Theorem 3, and indicate why we refer to them as 
"cubes". 

In the proof of Theorem 1 and Theorem 3 the cubes considered arise from a naturally constructed 
dyadic lattice. In the standard Euclidean case, Theorem 1, we simply will take the standard dyadic 
lattice. However, in the proof of Theorem 3 we will need to transfer certain parallelepiped regions 
to a spherical neighborhood of the sphere. This situation arises since, in a neighborhood of the 
sphere, the metric A will look like a variant of the standard Euclidean metric but with different 
powers appearing. It would be geometrically "nicer" if these shapes were actual (non-Euclidean) 
balls. However, this difficulty can be overcome since one actually can replace these non-Euclidean 
cubes by non-Euclidean balls. This change between cubes and balls is not a total triviality because 
the operator does not have a positive kernel. In the Euclidean setting, we refer the reader to 
[13] for this passage from cubes to balls, and remark that these changes can be adapted to handle 
the case of a non-Euclidean metric verbatim. At the end of the paper when we prove Theorem 3 
we provide a few more words about the structure of the "cubes" Q that appear. 

The next result is a corollary of Theorem 3. We apply this corollary to obtain the characterization 
of Carleson measures in the whole scale of our spaces of analytic functions in the ball. Consider 
our special kernels 

K m (z, w) = Re 1 — - . 

(1 — w ■ z) m 

Consider a probability measure fi supported on X C l^. A well-known necessary condition for 
the space B% (B^d) to be imbedded in L 2 (B 2( ^; u) is 

M£a(C,0) <Cir 2 °, VCe<9B 2d . (2.2) 

This is easily seen by testing the embedding condition on the reproducing kernel for the space of 
functions. However, (2.2) can be rewritten in a form akin to the conditions on the measure in 
Theorem 3. Namely, of course (2.2) is equivalent to (with another constant) 

»(B A ((,r)) <C 2 r 2 °, VJ3 A (C,r)) : £ A (C, r)) n C d \ M 2d f . (2.3) 
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In turn, (2.3) can be rephrased as saying 

every metric ball such that //(-E>a(C; r )) > C2 r2a , is contained in the unit balllB^d . (2-4) 

Theorem 4 (Main Result 3). Let fi be a probability measure with compact support in X n {z £ C d : 
1/2 < \z\ < 1} and all balls B<\ in the metric A such that fi(B A (x,r)) > r m lie inside the ball M 2 d- 
Finally, suppose that for all A-cubes Q a 'Tl Condition" holds for the operator T Mm with kernel 
K m : ' 

ll r M,mXQll|2(x ;M ) < A KQ) f or all . (2.5) 
Then \\T^ m \\ L 2^ x .^ L 2 {x .^ < C(A,m,d,T). 

Proof. The proof is just a particular case of Theorem 3. We already noted that the kernels K m 
satisfy the estimate 

\K m (z,w)\ < — - , 

max(a(^J m , d{w) m ) 

where d{z) := distA(z, C d \ M 2 d)- In fact it is obvious, because with respect to the metric A (and 
not only in Euclidean metric) d(z) = 1 — \z\. 

On the other hand in [18, 19] it is shown that K m (z, w) on M 2 d is a Calderon-Zygmund operator 
with parameters m, r = 1/2 with respect to the quasi-metric A. We remark that this is a non-trivial 
observation. All together this implies that we are under the assumptions of Theorem 3, and we are 
done. □ 

Theorem 4 is, as we already noted, exactly the following theorem giving a necessary and sufficient 
condition for the "Bergman-type" operators to be bounded on L 2 (/i). 

Theorem 5. Let fi be a probability measure supported in {z € : 1/2 < \z\ < 1}. Then the 
following assertions are equivalent: 

(a) The measure fi satisfies the following conditions: 

(i) v{B A {x, r)) < Ci r m , V5 A (x, r) : B A (x, r)nC d \ M 2d + 0; 

(ii) For all A-cubes Q we have \\T^ m XQ\\ 2 L 2^ X -^) ^ C2A*(Q)- 

(b) 

\\Tfj,,m\\L 2 (X;n) < C3 < OO. 

Here C 3 = C(d, C 2 ,m,r), d = C(C 3 ,m), C 2 = C(C 3 ,m). 

We next apply this Theorem to study the Carleson measures for the Besov-Sobolev spaces of 
analytic functions on the unit ball in C d . 

2.2. Carleson Measures on Besov Sobolev Spaces of Analytic Functions. In this section 
we give a characterization of Carleson measures for the Besov-Sobolev spaces B^{W>2d)- 

Recall that the space B^(J&2d) is the collection of analytic functions on the unit ball l>2d i n C rf 
and such that for any integer m > and any < a < 00 such that m + a > ^ we have the following 
norm being finite: 

m— 1 



dV(z) 

B: 

One can show that these spaces are independent of m and are Hilbert spaces, with obvious 



^:=£|/ (J) (°)| 2 + / |(l-l^ r ^/M(,)P 



j=0 



inner products. The spaces B 2 T (M2d) are reproducing kernel Hilbert spaces with kernels given by 

kl(z) = -. — — . 2<t ■ A minor r 

AV ' (i-A-^) 

logarithmic reproducing kernel. 



k\(z) = -, — 1 . ■>„ . A minor modification has to be made when a = 0, but this introduces a 
(l-A-z) 



A non- negative measure \x supported inside IB^d is called a -B 2 (B2d)-Carleson measure if 

\f(z)\ 2 d^{z) < C(^\\ff B 2 {M2d) V/ G Bl(M 2d ). 
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This is a function theoretic property and is looking for the measures [i that ensure the continuous 
embedding of ^(B M ) c L 2 (B M ; /x) . 

The norm of the Carleson measure [i is given by the best constant possible in the above em- 
bedding. There are also geometric ways that one can characterize the (!>2d)-Carleson measures. 
These characterizations are typically given in terms of the "capacity" associated to the function 
space i?2 (®2d) an d an interaction between the geometry of certain sets arising from the reproduc- 
ing kernel k^(z). See for example [5]. However, these characterizations had the restriction of only 
working in the range < a < ^, and when ^ < a. Namely, previous methods were unable to 
answer the question in the difficult range of | < a < |. However, using the difficult methods of 
non- homogeneous harmonic analysis, we can give a characterization of the E>2 (^>2d) using the Main 
Theorem in the form of Theorem 5 (or, equivalently, Theorem 4) for all values of a at once. 

Remark 6. We can always assume that the support of the measure is compactly contained in the 
ball V>2d- In fact, in this case the embedding is always continuous, but we are interested in the norm 
of this embedding, and its effective estimate. This assumption allows us to not worry about the 
boundedness per se, which is pleasant when working with Calderon-Zygmund kernels and excludes 
questions of absolute convergence of integrals. This also explains why we needed Theorems 4 and 
5 only for compactly supported measures. 

We begin by recalling the following fact found in the paper by N. Arcozzi, R. Rochberg, and E. 
Sawyer, [2]. First, some notation since the following proposition holds in an arbitrary Hilbert space 
with a reproducing kernel. Let J be a Hilbert space of functions on a domain X with reproducing 
kernel function j x . In this context, a measure [i is Carleson exactly if the inclusion map i from J 
to L 2 (X;[i) is bounded. 

Proposition 7. A measure n is a J -Carleson measure if and only if the linear map 



is bounded on L 2 (X;fj,). 

Proof. The inclusion map l is bounded from J to L 2 (X;fi) if and only if the adjoint map L* is 
bounded from L 2 (X;/i) to J, namely, 




tffj = (ffiffij < C\\f\\ 2 LHx .^, V/ € L 2 {X-^). 



For an x € X we have 



ff(x) = (ff,j x )j 



(f, tjx)L' 2 (X;ti) 




and using this computation, we obtain that 




8 



A. VOLBERG AND B. D. WICK 



Since we know that the continuity of t* for general / is equivalent to having it for real /, without 
loss of generality, we can suppose that / is real. In that case, we can continue the estimates with 

Kf\\j= ! [ Rej w (w')f(w)f(w')d^w)dfi(w') = (TfJ) L 2 {x .^. 
Jx Jx 

But, the last quantity satisfies the required estimates exactly when the operator T is bounded. □ 
When we apply this proposition to the spaces B% (JS>2d) this suggests that we study the operator 

= J Be ( (1 _^ )2 . ) fHd^w). 

And, it is here that the tools of non-homogeneous harmonic analysis will play a role. Using Theorem 
5, we have the following characterization of Carleson measures for B% ($>2d)- The statement below 
is simply a restatement of Theorem 2. 

Theorem 8. Suppose that < a. Let fi be a positive Borel measure in f&2d- Then the following 
conditions are equivalent: 

(a) n is a B%(M2d) -Carleson measure; 

(b) T^2a ■ L 2 (M 2 d;fJ.) -> L 2 (B 2d ;/i) is bounded; 

(c) There is a constant C such that 

(i) \\T^2a XQ Hx,2 (At ) < C n{Q) for all A-cubes Q; 

(ii) fj,(B A (x,r)) < Cr 2a for all balls B A (x,r) that intersect C d \M 2 d- 

The equivalence between (a) and (b) follows simply from the abstract Proposition 7. The equiv- 
alence between (b) and (c) follows from Theorem 5 and a well-known remark that (ii) is necessary 
for (b) to hold, see e.g. [2,18,19]. 

We remark that in the range < a < ^ Theorem 2 was known. It was proved by N. Arcozzi, 
R. Rochberg, and E. Sawyer using the machinery of function spaces on trees, see [2]. It was also 
proved, more in the spirit of what appears in this paper, by E. Tchoundja, see [18,19]. He adapted 
the proof of J. Verdera of the boundedness of the Cauchy operator on L 2 (fj,), [24]. Key to both these 
approaches was the fact that when < a < i a certain "positivity" of the kernel could be exploited. 
This positivity is cruelly missing when a > ^, but can be overcome via the "Tl Conditions". We 
also remark that the range ^ < a < ^, was previously unknown, and this theorem then answers 
the question in the whole range of possible a. 

2.3. Theorem 5 and the Standard Carleson Measure Conditions. In this sub-section, we 
show that the conditions in Theorem 5 reduce to the well known geometric conditions for Carleson 
measures on the Hardy space and the Bergman space. Setting a = ^ one can show that the space 

B^ (E>2n) = H 2 (B>2n), the Hardy space of analytic functions on the unit ball B n . Similarly, if we set 

n+l 

a = , the we see that B 2 2 (B n ) = A 2 (B n ), the space of square integrable functions on B n with 
respect to Lebesgue measure. 

We further focus on the case in one dimension, d = 1, since the ideas are easier to see, though 
they readily can be adapted to several variables and could be applied to demonstrate that the 
known geometric conditions for Carleson measures in the Hardy space of the ball and Bergman 
space of the ball are equivalent to a the more exotic Tl condition appearing in Theorem 5. However, 
when the geometric condition is more difficult to understand, such as when it is characterized via 
capacity conditions, then it is not immediately clear that the geometric and the Tl conditions are 
in fact equivalent without passing through the equivalence of the main theorem. 

Finally, we will work in the case of the upper half plane C+. The main theorems apply in this 
context, either via conformal mappings and changes of variables. Key to these computations is the 
Mobius invariance of the function spaces B% (Bad). Or, one can set-up the entire problem in the 
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context of the upper-half plane, then simply apply the method of proof of the main theorems to 
obtain the geometric characterization in this context. 

2.3.1. Carleson Measures on the Hardy Space. A measure n is a Carleson measure for i7 2 (C+) (or 
H 2 (B)) if 

1 2 j , , / „\ ^- s~i f , , \2 II £ i|2 v_/ -f *— u ^ i 



c. 



As is well-known this function-theoretic condition happens if and only if the follow geometric 
condition is satisfied for all tents T(I) over JcK: 

fi (T(/)) < C\I\ VI Cl. 

First note that if I C K. is an interval, then T(I) will be a cube in M. 2 . If we know that Condition 
(ii) of Theorem 5 (in the case when d = 1 and a = |) 5 holds, then restricting the outer integral in 
that condition to T{I) and using standard estimates for the kernel one can see that 

^<C M ,(T W ) 

which upon rearrangement, yields, the geometric Carleson condition. 

Next, we want to show that if [i satisfies the geometric Carleson condition, then in fact we 
have Condition (ii) of Theorem 5 holding as well. In other words, we want to see easily why the 
usual geometric Carleson condition on fj, implies the boundedness of our operator on characteristic 
functions of squares. 

We first observe that the function 



Jr2 ? - z 



belongs to £P(C_) (since \x is supported on the upper-half plane) with norm a constant multiple 
of yj fi(Q). This is a well known fact and can be found for example in [16], however, we provide 
the proof now for convenience. Let (p be a test function on M, then we have that 

*(*) := / ^dx 

is analytic and belongs to the Hardy class -ff 2 (C_). Next, note that, 

/ <p(x)F Q ^(x)dx = [ md^O- 
Jm. Jq 

Using Holder's inequality and using that [i is a Carleson measure, and so the function theoretic 
embedding also holds, we see that for all test functions we have 



1/2 



y{x)F Q ^{x)dx = / HOMO 
Jq 

< y/iW)(f l*(0l 2 ^(0 

< Ml 2 (Rd.^VKQ)- 

Taking the supremum over all possible test functions proves the claim. However, since the 
function Fq„ belongs to // 2 (C_), then another application of the Carleson Embedding property 
(or equivalently, the geometric condition for Carleson measures) gives that 



l^iXolli^) = J \F Q ^z)\ 2 d^z) < C{^)^{Q). 
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Thus, we see that in this case, we have the equivalence between the standard Carleson Embedding 
Condition and Condition (ii) of Theorem 5. 

2.3.2. Carleson Measures on the Bergman Space. A measure /i is a Carleson measure for the 
Bergman space ^4 2 (C+) if we have that 

\f(z)\ 2 d^(z) < C(/i) 2 ||/||i 2(c+) V/ G ^ 2 (C+). 

Let Fc + denote the set of squares Q in the upper half-plane with sides parallel to the axis such 
that the boundary of Q intersects M. A well-known fact is that this function theoretic condition is 
equivalent to the following geometric condition 

n{Q)<C\Q\ VQgF c+ . 

This can be found in the paper by Hastings, [7] (after a conformal change of variable). See also 
the paper by Luecking, [10]. We want to show that the "Tl Condition" that appears in Theorem 
5 reduces to this well known geometric condition. 

Again, by standard estimates for the Calderon-Zygmund kernel in the case when d = 1, a = 1, 
we have the well-known geometric characterization of the Carleson measures for the Bergman space 

n(Q)<C(ji)\Q\ VQeF c+ . (2.6) 

We now want to demonstrate that if the standard geometric characterization (2.6) of Carleson 
measures for the Bergman spaces holds, then we have that Condition (ii) of Theorem 5 holds as 
well. Denote 



1 



Jc+ Is, - z ) 

We want to deduce from (2.6) that 



\W,z)\\l* { c + ;»)<Cn{Q) (2-7) 

for any Q € Fc . The idea behind the proof will be to decompose the two appearances of the 
measure \x in (2.7) with simpler analogous that arise from condition (2.6). These decompositions 
will introduce error terms that need to be controlled, however these errors will be positive operators 
that can be controlled via Schur's test. We now explain in detail. 

Fix Q € Fc + and decompose it to the union of rectangles q such that dist(g,R) ~ diam((7). This 
can be done in a standard fashion. Now define a function w that is constant on each q with value 
given by fi(q)/\q\. Then by the geometric definition of Carleson measures on the Bergman space 
we have that 

IMU«»(C + ) <AC(jjl), (2.8) 
where A is an absolute constant, and C{fj) is the Carleson constant appearing (2.6). Also 

fwdm 2 = n(Q). (2.9) 
Jq 

We write t^(Q,z) as the sum of integrals over all the q's. If we replace the measure \x in q by 
w drri2 in q we will get an error, which is easy to estimate, namely 

\t„(Q,z)-t wdm2 (Q,z)\<A f )^ C [ XQ(()wdm 2 {(). (2.10) 

Jc+ Is - z \ 

Denote 

l»CI 



K(C,z) :-- 



\C--z 



3 ' 
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To prove (2.7), by an application of (2.10), we are left with proving the following two estimates 



t 



w drri2 



K((,z)xQ(()wdm 2 (0 



L2(C +;M ) 



< Cfi(Q) VQeF c+ . 



(2.11) 
(2.12) 



We first focus on proving (2.11). Repeating the averaging procedure described above, we can 
replace the measure /i by wdm 2 and (2.11) reduces to proving 

\\t wdm2 (Q^)\\h(C + ; W dm 2 )<CKQ) VQGF C+ . (2.13) 

Again, performing this averaging introduces an error term which is at most proportional (up to an 
absolute constant) to 



with 



K 2 {(, z) Xq(() w dm 2 (() 



(2.14) 



Now (2.13) is obvious from (2.8) and from the fact that 1/(C — z) 2 represents the kernel of a 
C alder on- Zygmund operator on the plane. In particular, it is bounded on L 2 (C+; m 2 ), and using 
(2.8) and (2.9) we obtain 



dm 2 (Q,z)\\ L 2 (c+ . wdm2) < A\\w\\ L oo {c+) / w dm 2 < A\\w\\ Loo(c+) / wdm 2 = AC(fi) fi(Q) , 



Q 



Q 



which is exactly (2.13). 

We are left to prove (2.12) and (2.14). But, these kernels are positive, and they change by an 
absolute multiplicative constant when the argument runs over any q which we introduced in the 
averaging of the measure \i. So (2.12) is the same as the following estimate: 



K 1 ((,z) X Q(Qwdm 2 (0 



Z/ 2 (C-|_;«i> dm,^) 



<C/i(g) VQeF c+ . 



(2.15) 



We are left to prove (2.15) and its counterpart with K 2 appearing. However, one can easily see 
that these positive kernels induce bounded operators L 2 {C + \m 2 ) —> L 2 {C + ;m 2 ). 
A direct calculation shows that: 

f IQfCI . . „ 1 

/ -, ttttt, dm 2 ( z ) < A —7- . 

Jc + i^Mc-^l 3 " l^ci 1 / 2 



|5CI 1/2 
IC-^I 



dm 2 (() < A 



\Zz\ V2 



Since they are positive operators, one can conclude they are bounded operators by an application 
of Schur's Test and the computations above. So, 



K 1 ((,z)x Q (()wdm 2 (() 



<A\\w\\ L oo {c) [ w 2 dm 2 <AC(fi) 2 fi(Q), (2.16) 
Jo 



L 2 (C-|- ;w dm 2 ) 

by (2.8) and (2.9) again. But this is exactly (2.15). The same argument holds for the kernel K 2 . 
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3. The beginning of the proofs of Theorems 1,3. Littlewood-Paley 
Decompositions with respect to random lattices: Terminal and Transit Cubes 

The difficulty lies in the fact that fi is virtually arbitrary. For example, a general measure (i need 
not have the doubling property. The key idea about how to cope with the "badness" of [i is the 
use of randomness. We will decompose functions using random dyadic lattices, and then resulting 
averages will somehow be good. This will allow us to "smooth" out the difficulties of the measure 
/x. 

The proof of Theorem 1 will be divided in several parts. We begin by collecting collections of 
cubes that are beneficial for the proof of the Theorem. These collections are then used to decompose 
arbitrary functions in L 2 (M d ; \x). In the remaining sections since there is no potential for confusion, 
we will let T denote the operator T^ jm appearing in Theorem 1. 

We assume that F = supp \i lies in a cube jQ, where Q is a certain unit cube. With out loss of 
generality, consider this cube as one of the unit cubes of a standard dyadic lattice V. Furthermore, 
let T>\, T>2 be two shifted dyadic lattices, one by a small shift u\, another by L02, where Ui € = O. 
These shifts are independent and the measure P on f2 is normalized to 1. 

3.1. Terminal and transit cubes. We will call the cube Q € T>i a terminal cube if 2Q is contained 
in our open set H or fi(Q) = 0. All other cubes are called transit. Let us denote by D* erm and T>j r 
the terminal and transit cubes from We first state two obvious Lemmas. 

Lemma 9. If Q belongs to £>* erm ; then 

This follows since Q C 2Q C H and so for x, y £ Q we have that d(x) > i{Q) and similarly for 
y. Another obvious lemma: 

Lemma 10. If Q belongs to T>f~ , then 

p{Q) < C{d)l{Q) m . 

We would like to denote Qj as a dyadic cube belonging to the dyadic lattice T>j. Unfortunately, 
this makes the notation later very cumbersome. So, we will use the letter Q to denote a dyadic 
cube belonging to the lattice T>\ and the letter R to denote a dyadic cube belonging to the lattice 
T> 2 . 

From now on, we will always denote by Qj (J = 1, . . . , 2 d ) the 2 d dyadic subcubes of a cube Q 
enumerated in some "natural order". Similarly, we will always denote by Rj (j = 1, . . . , 2 d ) the 2 d 
dyadic subcubes of a cube R from T>2- 

Next, notice that there are special unit cubes Q° and R° of the dyadic lattices T>\ and T>2 
respectively. They have the property that they are both transit cubes and contain F deep inside 
them. 

3.2. The Projections A and Aq. Let T> be one of the dyadic lattices above. For a function 
ip £ L 1 (R d ; fj) and for a cube Q C R d , denote by (^)q the average value of ip over Q with respect 
to the measure fx, i.e., 

w « := mI Q i ""' 

(of course, (V')q makes sense only for cubes Q with fJ,(Q) > 0). Put 
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Clearly, Ap € L 2 (R d ; p) for all p € L 2 (M. d ; p), and A 2 = A, i.e., A is a projection. Note also, that 
actually A does not depend on the lattice T> because the average is taken over the whole support 
of the measure p regardless of the position of the cube Q° (or R°). 

Remark 11. Below we will start almost every claim by "Assume (for definiteness) that £(Q) < 
£(R). . .". Later, for ease of notation, we will write that a cube Q 6 X n y and mean that the 
dyadic cube Q has both property X and 3^ simultaneously. 



For every transit cube Q G T>\, define A n p by 



A n p\ :=0, A n p 



if Qj is transit; 



Qj nX <p — (<p)g if Qj is terminal 

(j = 1, . . . ,2 d ). Observe that for every transit cube Q, we have p(Q) > 0, so our definition makes 
sense since zero can not appear in the denominator. We repeat the same definition for R € T>i. 
We have the following easy properties of the operators Agp. 

Lemma 12. For every p € L 2 (JH d ; p) and every transit cube Q the following properties hold: 

(1) A Q peL 2 (R d ;p); 

( 2 ) jRt A Q i P d i j ' = ; 

(3) A is a projection, i.e., A 2 = A ; 

(4) A Q A = AA Q = 0; _ 

(5) If Q)Q are transit, Q ^ Q, then A^A~ = 0. 

The proof of this Lemma is given by direct computation and is left as an easy exercise for the 
reader. Using the operators A^p we have a decomposition of L 2 (M. d ; pi) functions. 

Lemma 13. For every p € L 2 (M. d ; p) we have 

p = Ap+ A Q^ 

Q£V\ r 

the series converges in L 2 (R d ;p) and, moreover, 

imi 2 =iiA^i 2 + y iiA n <^ii 2 

Qev* r 

Proof. Note first of all that if one understands the sum 

Qev\ r 

as Hindoo ^Qe£>f -e(Q)>2- k > then for //-almost every x £ M d , one has 

<p(x) = Ap(x) + ^ A Q y>(x). 

Indeed, the claim is obvious if the point x lies in some terminal cube. Suppose now that this is not 
the case. Observe that 

Ap(x)+ Yl A q^(x) = {<p) Qk , 

QeV\ r :t{Q)>2- k 
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where Q k is the dyadic cube of size 2~ k , containing x. Therefore, the claim is true if 

(v) Qk -> <Pfr) . 

But, the exceptional set for this condition has /U-measure 0. 

Now the orthogonality of all Aq^j between themselves, and their orthogonality to Aip proves the 
lemma. 

□ 

4. Good and bad functions 

In this section we introduce a decomposition of functions / € L 2 (K d ; jS) into "good" and "bad" 
parts based on random dyadic lattices. Additionally, it is shown that the proof of Theorem 1 can 
be reduced to the case of "good" functions, since the "bad" terms can be averaged out. Finally, 
we decompose the needed estimates on the "good" functions into three distinct terms that will be 
handled separately. 

We consider functions / and g € L 2 (M. d ; n). Fixing uj\,uj2 € Q we construct two dyadic lattices 
T>\ and T>2 as before. By Lemma 13 the functions / and g have decompositions given by 

f = Af+ A Q/' 9 = ^9 + E A *9- 

Q€T>[ r ReVtf 

For a dyadic cube R we denote U 2 =1 dRi by sk R, called the skeleton of R. Here the R{ are the 
dyadic children of R. 

Definition 14. Let r,m be the parameters of the Calderon-Zygmund kernel k. We fix a = 2r +2m 
and a small number 5 > 0. Fix S > 2 to be chosen later. Choose an integer r such that 

2~ r < 5 s < 2~ r+1 . (4.1) 

A cube Q € T>i is called bad (or <5-bad) if there exists R^T>2 such that 

1) t(R) > 2 r £(Q) , 

2) dist(Q,skR) < i(Q) a l{Rf- a . 

Let B\ denote the collection of all bad cubes and correspondingly let Q\ denote the collection 
of good cubes. The analogous definitions give the collection of bad cubes R £ T>2, £>2 and the 
collection of good cubes £?2- 

We say, that ip = ^QeD[ r ^Qf 1S bad if in the sum only bad Q's participate in this decomposition. 
The same applies to ip = SqgI'* 7 ' ^Q^- ^ n particular, given uj\,uj2 € ft, we fix the decomposition 
of / and g into good and bad parts: 

/ = fgood + had , where f good = Af + ^ A Q f , 

Qez>fn0i 

9 = g 9 ood + 9bad , where g good = Ag + ^ A R g . 

Theorem 15. One can choose S = S(a) in such a way that for any fixed Q 6 T>\, 

F^iQ is bad} < 5 2 . (4.2) 
By symmetry F Ul {R is bad} < 5 2 for any fixed R € T>2- 
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Proof. Consider the unit cube Q° of T>\, which satisfies F C \Q°. In particular, oj + Q° contains 
F in ^(w + Q°) for every oj € (— 1/40, 1/40] d . Recall that the normalized Lebesgue measure on 
the cube (— 1/40, l/40] d is our probability measure. In the construction of our random lattices, 
all the lattices "start" with the cube of unit side given by oj + Q°, and then oj + Q° is dyadically 
subdivided, and thus generates the dyadic lattice T>(oj). We do two such constructions, giving us 
two such arbitrary lattices Pi, T> 2 . By the definition above, a cube Q 6 T>\ is bad if there exists a 
cube R £ V 2 such that dist (Q,di2) < ^(Q) a £(i?) 1 ~ a and i(R) > 2 r £(Q). 

To choose the number S we first fix an integer k > r. Let us estimate the probability that there 
exists a cube R € V 2 of size l(R) = 2 k £(Q) such that dist(Q,dR) < l{Q) a l^R) 1 ^ . Geometric 
reasoning show it is equal to the ratio of the area of the narrow strip of size 2^ l ~ a ^ k £(Q) around 
the boundary of the cube R to the whole area of 2 k Q. The area of the strip around the boundary 
is at most C\Q\2 dk ~ ak (here C = C(d) is an absolute constant depending only on the dimension 
d). We conclude that this ratio is less than C2~ ka . Therefore, the probability that the cube Q is 
bad does not exceed 

cVr b = — — . 

i _ 2~ a 

k=r 

Since the integer r was chosen so that 2~ r < 6 s < 2~ r+1 we will then simply choose the minimal 
S = S(a) such that rz§=a < S 2 (of course, S = 3/a is enough for all small <5's). □ 

The use of Theorem 15 gives us S = S(a) in such a way that for any fixed Q € Pi, 

F^iQis bad} < 5 2 . (4.3) 

We are now ready to prove 

Theorem 16. Consider the decomposition of f from Lemma 13 and the resulting fbad arising from 
every oj 2 G £1. One can choose S = S(a) in such a way that 

n\\fbad\\ )<*ll/ll (4-4) 

The proof depends only on the property (4.3) and not on a particular definition of what it means 
to be a bad or good cube. 



Proof. By Lemma 13 (its left inequality), 



mifbadWww)) <®( £ iiaq/i i2 



L 2 (R d ;^i) 



1/2 



Q€V\ r nBi 

Then 



^(\\fbad\\L2(Rd.^) < (E £ ll A Q/llL2( R d ;At )) 

Qez>f nz?i 

Let Q be a fixed cube in T>\\ then, using (4.3), we conclude: 

EuJAq/II^^ = P W2 {Q is bad}||A Q /||2 2{R£i;/j) < 5 2 \\A Q f\\l 2{Rd ^ . 
Therefore, we can continue as follows: 

®(\\fbad\\l?(B*w)) < S ( £ \\ A Qf\\h(Ri.^) 1 < *||/|Ua(R««j M ) ■ 
The last inequality uses Lemma 13 again (its right inequality). 



□ 
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4.1. Reduction to Estimates on Good Functions. We consider uj\i^i € f2, two dyadic lattices 
T>\ and T>i corresponding to the shifts of the standard lattice by oj\ and u>2- Now take two functions 
/ and g € L 2 (M. d ; fj,) decomposed according to Lemma 13 

/ = A/+ Yl A Q/> 9 = ^9 + E 

which we further decompose into / = f goo d + fbad, and similarly g = g goo d + gbad- Let T stand for 
any operator with a bounded kernel. In the future it can be, for example, the operator with the 
kernel (rj > 0) (and then appropriately symmetrized) 

(% — w) 

k(z, w) = —j- — j > z,w eC + . 

\Z — W\* + T] A 

Then 

(Tf,g) = {T f good , g good ) + R{ wl , U2 ){f , g) , where R^ 1>UJ2 )(f, g) = (Tf bad ,g) + {T f goo d, 9bad) ■ 
Theorem 17. Let T be any operator with bounded kernel. Then 

E |-R(cJi,aj 2 )(/'5')l < 2 ^ll r llL 2 (R ti ; At )^L 2 (IR d ; At )ll/llL 2 (R d ;A t )ll5llL 2 (IR ti ;/i) ■ 

Remark 18. Notice that the estimate depends on the norm of T and not on the bound on its kernel! 

Proof. Observe that taking the good or bad part of a function are projections in L 2 (W i ; /i), and so 
they do not increase the norm. Since we have that the operator T is bounded, then 

[•fr(wi,W2)(/>flOI - \\ T \\L 2 (R d ; f j,)^L 2 (R d -^) (\\9\\L 2 (R d ; l j,)\\fbad\\L 2 (R d ^) + II / Hi 2 (M. d ;/j,) \\9bad \\ L 2 (R d ;At)) 

Therefore, upon taking expectations we find 

^\R(aj u uj 2 )(f,g)\ < \\ T \\L 2 (R d ; l j,)^L 2 {R d ; l i) ( llS'lli 2 (Rd ;M ) E ( II had II L 2 (R<V) ) + II / Hi 2 (R d ;At ) E ( Wdbad \\ L 2 {R d ;fi) ) 

Using Theorem 16 we finish the proof. 

□ 

Theorem 17 implies that we only need to obtain the following estimate: 

\{T fgood, 9 9 ood)\ <C(T,m,d,A)\\f\\ L 2 (M d. fl) \\g\\ L 2 (M d.^. (4.5) 
In fact, considering any operator T with bounded kernel we conclude 

(Tf, g) = E(Tf, g) = E(T/ 90od , gg 00 d) + ^R(^ 2 ) (/, 9) ■ 
Using Theorem 17 and (4.5) we have 

\(Tf,g)\ < C \\f\\L 2 (R d ;fi)\\9\\L 2 (R d ;tJ,) + ^\\ T \\L 2 {R d ; f i)^L 2 {R d ; f i)\\f\\L 2 {R d ; f j,)\\9\\L 2 {R d ;tJ,) ■ 

From here, taking the supremum over / and g in the unit ball of L 2 (R d ; /i), and choosing 6 = \ we 
get 

\\ T \\L 2 (R d ; l i)^L 2 (R d ; l i) - 2 C • 
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4.2. Splitting (Tf goo( t,gg 00 d) into Three Sums. Our next reduction is to show that we can only 
deal with the case when / and g are good functions and with zero average. To accomplish this, we 
will remove the term from f goo d corresponding to the term A. 

We fixwi,(j2 € £1, and the two corresponding dyadic lattices T>\ and T>2, and recall that F = 
supp \i is deep inside a unit cube Q and shifted unit cubes Q° € T>±, R° £ T>2- 

If / € L 2 (M. d ; fj,), we have by the "Tl Condition", Condition (1.1), and an application of Cauchy- 
Schwarz that 

WTAfWv^ = </> Q o||r XQ o|| i2(Rt , M) < / llt2(R ^ ( ^ ^ = A\\f\\ LHRd .^ . 

So we can replace / by / — A/. These computations of can course be repeated for g and so from 
now on we may assume that 

f(x) d[i(x) = and / g(x) d/j,(x) = . 

We skip mentioning below that Q € D\ and R £ since this will always be the case. Com- 
puting the inner product between Tf and g we see, 

(Tf,g)= Yl (A Q f,A R g)+ £ (A Q f,A R g). 

QeSi,ReS2,t(Q)<t( R ) QeQi,ReQ 2 AQ)>i(R) 
The question of convergence of the infinite sum can be avoided here, as we can think that the 
decompositions of / and g are only into a finite sum of terms (notice that the decomposition of a 
good function is also good). This removes the question of convergence and allows us to rearrange 
and group the terms in the sum in the way that is most convenient for our interests. 

Clearly, we need to estimate only the first sum above, the second will follow by symmetry. For 
the sake of notational simplicity we will skip mentioning that the cubes Q and R are good and we 
will skip mentioning £(Q) < £(R). So, from now on, 

simply means 

<2,i?:other conditions Q,R:i(Q)<i(R), QeSi, R&S2, other conditions 

Remark 19. It is convenient sometimes to think that the summation 

E 

Q,R: other conditions 

goes over good Q and all R. Formally, it does not matter, since / and g are good functions, it is 
merely the matter of adding or omitting several zeros. For the symmetric sum over Q and R with 
the property £(Q) > £(R) the roles of Q and R in this remark must be interchanged. 

The definition of <5-badness involved a large integer r, see (4.1). Using this notation, we write 
our sum over £(Q) < £(R) as follows (we suppress (Aq/, A^g) in the display below for ease of 
presentation) : 

^(A Q f,A R9 ) E + £ 

Q,R Q,R-l(Q)>2- r e{R) Q,R:i(Q)<2- r i(R) 

E + E 



+ 



},R:£(Q)>2- r £(R),dist(Q,R)<£(R) Q,R:£(Q)<2- r e(R),QnRjtd 

E + E 

- Q,Rl(Q)>2- r i(R), dist(Q, R)>£(R) Q,Rl(Q)<2- r l(R), Qni?=0 
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We then define 
o\ : 

0"3 : 



(Aq/,A^) 

\R:t{Q)>2- r l{R), dist(Q, R)<t{R) 

E + E 

Q,R:i(Q)>2~ r £{R), dist(Q, R)>£{R) Q,R:£{Q)<2- r e(R), QnR=$ 

£ (A Q f,A R g) 

>,R:l(Q)<2- T l(R), QnR^Q 



(A Q f,A R g) 



(4.6) 

(4.7) 
(4.8) 



4.3. Potential ways to estimate J Kd Jj Rd k(x,y)f(x)g(y) dfi(x) dfi(y). Recall that the kernel k (x, y) 
of T satisfies the estimate 

\Hx,y)\< tJ , L Jt 7=r. , d(x)=dist(x,R d \H), 



max(d(x) m ,d(y) m ) 



H being an open set in and 



This above inequality implies that 



\k(x,y)\ < 



Cc2 



\x - y\ 



\k(x,y)-k(x > ,y)\<C C z- 



x — X 



\X — 

provided that \x — x'\ < \\x — y|,with some (fixed) < r < 1 and < Ccz < oo. We will sometimes 
write 



K x i y)f( x )g(y) d^(x) dn(y) 



[k(x, y) - k(x ,y)]f(x)g(y) d[i(x) dfi(y) 



using the fact that our / and g will be Aq/ and Aug and so their integrals are zero. Temporarily 
call K(x,y) either k(x,y) or k(x,y) — k(xo,y). 
After that we have three logical ways to estimate 

K(x,y)f(x)g(y)dfi(x)dn(y) : 

(1) Estimate \K\ in L°°, and f,g in the L 1 the norm; 

(2) Estimate \K\ in L°° x L 1 norm, and / in L 1 norm, g in L°° norm (or maybe, do this 
symmetrically) ; 

(3) Estimate \K\ in L 1 norm, and /, g in the L°° norm. 

The third method is widely used for Calderon-Zygmund estimates on homogeneous spaces (say 
with respect to Lebesgue measure) , but this estimate is very dangerous to use for non- homogeneous 
measures. To see why, consider the following reason. After / and g are estimated in the L°° norm, 
one needs to continue these estimates to have 1? norms. There is nothing strange about this since 
typically / and g are almost proportional to characteristic functions. But, for / living on Q such 
that / = cqXq (cq is a constant), 

1 

L '\L 2 (SL d ;n) 



L°°(R d ;^) ^5 



The same reasoning applies for g on R. Then 



K(x, y)f(x)g(y) dji{x) d[i(y) 



< 



However, a non-homogeneous measure has no estimate from below and having two uncontrollable 
almost zeroes in the denominator is a very bad idea. We will never use the estimate of type (3). 
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On the other hand, estimates of type (2) are much less dangerous (although care is still required). 
Because, in this case one applies 



1 



l5llL 2 (M d ;/i) 



and gets 



K(x, y)f{x)g{y) dy,{x) d/i(y) 



<vW) 



; M )IMI.L 2 (]tt d ;/i) • 



If we choose to use estimates of type (2) only for pairs Q, R such that Q C R we are in good shape. 
This is what we will be doing when estimating 03. 
Finally, estimates of type (1) yield the following 

K(x,y)f(x)g(y)dn(x)d(i(y) < VKQ)VKR)\\f\\L*(R*w)\\9\\L*(R*w) ■ 

These present no problems from the measure \x. But, estimates of this type will only be good when 
we can exploit decay from the cubes R and Q being far apart. This will be the strategy for handling 
term a<i- 



Plan of the Proof: The first sum is the "diagonal" part of the operator, o~\. The second sum, a 2 
is the "long range interaction". The final sum, 03, is the "short range interaction". The diagonal 
part will be estimated using our Tl assumption of Theorem 1.1, for the long range interaction 
we will use the first type of estimates described above, for the short range interaction we will use 
estimates of types (1) and (2) above. But, all this will be done carefully! 

5. The Long Range Interaction: Controlling Term o- 2 

We first prove a lemma that demonstrates that for functions with supports that are far apart, 
we have some good control on the bilinear form induced by our Calderon-Zygmund operator T. 
For two dyadic cubes Q and R, we set 

D{Q, R) := £{Q) + £{R) + dist(Q, R). 

Lemma 20. Suppose that Q and R are two cubes in M. d , such that £(Q) < £(R). Let (Pq,i[) r € 
L 2 (]R d ; / u). Assume that ifg vanishes outside Q, vp R vanishes outside R; f^ d ip„dpL = and, last, 
dist(Q,sup P V R ) > £{Q) a £{R) l ~ a . Then 

1 (^cg ) I < ^ ^ g g "l^ll v 7 ^^ a^c^) 1 1 ^ 1 (Rd ; 1 1 ^ I (Md ;M) ■ 



Remark 21. Note that we require only that the support of the function ipR lies far from Q; the 
cubes Q and R themselves may intersect! This situation will really occur when estimating 0*2- 



Proof. Let be the center of the cube Q. Note that for all x € Q, y G supp^ R , we have 
\x Q -y\> ^ + dist(Q, supp z/g > ^ + 2 r ^- a h(Q) > C{r, a)£(Q) >2\x-x Q 
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Therefore, 



\(<p Qi Til> R )\ 



< Ccz- 



i jx 

i [fc(sc, y) - k(x Q , y)]ip Q (x)ip R (y) dfi(x) dfi{y) 



\<Pr. 



dist(Q,supp i/j R ) n 

There are two possible cases. 

Case 1: The cubes Q and R satisfy dist(Q, supp ip R ) > £(R). If this holds, then we have 

D(Q,R) := £{Q) + £(R) + dist(Q, R) < 3 dist(Q, supp ijj R ) 

and therefore 



dist(Q,suppV> R ) m+T 



< C 



< c 



£(Q)2£(R)2 



D(Q, R) m+T ~ D(Q, R) m+T 

Case 2: The cubes Q and R satisfy £{Q) a £{RY^ a < dist(Q, supp ip R ) < 1{R). 
Then D(Q, R) < 3£(R) and we get 



i(Q) T 



< 



£(Q)U(R)i <c £(Q)U(Ry. 



dist(Q,supp^ H ) m+T ~ [£(Q) a £(R) 1 - a ] m + T £{R) m + T ~ D{Q, R) m+T 

The computation of the equality at this point, uses the particular choice of a = 2 ( T + m ) • Now, to 
finish the proof of the lemma, it remains only to note that 

and 



Applying this lemma to (p^ = &-qf and ip R = A R g, we obtain 



\a 2 \ <CY, 



Q,R 



£{Q)i£(R) 
D(Q, R) mJr 



7V^7M^I|A Q /|| L2(Rd;At) ||A R5 || L2(Rd;At) . 



□ 



(5.1) 



If we let Tq r denote the matrix defined by 

£{Q) T 2£{R) T 2 



?> R ' D(Q,R) 7 



- VMQ) VKR) (Q e v{ r , r g vt, £{Q) < £(R) ) 



then we will show that the corresponding operator generates a bounded operator in I 2 . This then 
will allow us to control the term |cr 2 1 . Since if the operator Tq r is bounded on I 2 we have 



(R d ;/i) 

- W T Q,r\\p^P 11/ II L 2 (M d ;Ai) Il3lli 2 (» d ;/Lt)- 
The last inequality follows by Lemma 13. 



L 2 (EL d ;n) 



R 



Lemma 22. For any two families {a n } 



Qevf 



and {b R } ReT) tr of nonnegative numbers, one has 



Ew-«*>«^E<>] 4 [£'>«] 

Q,R Q R 
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Remark 23. Note that Tq r are defined for all Q and R with £(Q) < t(R) and that the conditions 
dist(Q, R) > l(Q) a l(R) l ~ a (or even the condition Q CI i? = 0) no longer appears in the summation! 

Proof. Let us "slice" the matrix T„ R according to the ratio |tjS . Namely, let 



jT if£(Q) = 2- fc ^); 



otherwise , 

(k = 0, 1, 2, . . . ). To prove the lemma, it is enough to show that for every k > 0, 

E^V*^2-i*[y>^[y>y J . 



The matrix {Tq^} has a "block" structure: the terms 6^, corresponding to the cubes R € T>2 for 
which = 2 3 can interact only with the terms corresponding to the cubes Q G P* r , for which 

^(Q) = 2 J ~ k . Thus, to get the desired inequality, it is enough to estimate each block separately, 
i.e., to demonstrate that 



E E <4] 4 [ E 



=2J- fc R:i{R)=2i 

b, 



Q,R:£(Q)=2i- k ,£(R)=2i 

Let us introduce the functions 

F(:c) := Yl ~T^= X Q^ and := Yl T^= ^r( x )- 

Q : /(Q)=2*-* V M<9) /J ; £(.R) = 2^ V 

Note that the cubes of a given size in one dyadic lattice do not intersect, and therefore at each 
point x € M. d , at most one term in the sum can be non-zero. Also observe that 



\F\ 



'l 2 (r<V) 



E 



Q 



and IIGl 



:£(Q)=2i- fc 

Then the estimate we need can be rewritten as 



[ E 



/ / 

JR d JR C 



L 2 (R d ;fj.) L ^ R 
R:t{R)=2i 



\G\ 



where 



K hk ^y)F( X )G{y)d^x)d^y) < C \\F\ l2 ^ ^ 

£(Q)ie(R)-- 



Kj,k(x,y) 



E 



O(0, -R) 



m+T 



x (x)x R (y)- 



Q,R : e(Q)=23- k ,£{R)=2i 

Again, for every pair of points x, y € M. d , only one term in the sum can be nonzero. Since \x — y\ + 
£(R) < 3D(Q, R) for any x S Q, y £ R, we obtain 



K j>k {x,y) = C2 

< C2~ T 2 k 

So, it is enough to check that 



D(Q, R) m+T 
2 jT 



7 =:C2- T 2 k k j (x,y) 



[23 + \x- y\] m+ 

k j (x,y)F(x)G(y)d f i(x)d^y) <C\\F 

Recall that we called the balls "non-Ahlfors balls" if 

fi(B(x,r)) > r m . 



WGW 

L 2 (M. d ;fi) L 2 (M. d ;fi) 
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According to the Schur test, it would suffice to prove that for every y G IR rf , one has the estimate 
f Rd kj(x,y) dfi(x) < C and vice versa (i.e., for every x G M. d , one has f Rd kj(x,y) dfi(y) < C). 
Then the norm of the integral operator with kernel kj in L 2 (]R rf ; /i) would be bounded by the same 
constant C, and the proof of Lemma 22 would be over. If we assumed a priori that the supremum 
of radii of all non-Ahlfors balls centered at y G R with £{R) = 2 jf , were less than 2P + , then the 
needed estimate would be immediate. In fact, we could write 



kj(x,y) dfj,(x) = / kj(x,y)dn{x) + \ kj(x, y) d/j,(x) 

JB(y,2J+ 1 ) JR d \B(y,2i+ 1 ) 

<2-^(y,2^))+ / * T d^x) 

JHL d \B(v,Z>+ 1 ) \ x — V\ 



\B(y,2J+ 1 ) F - U\ 

< fi+ / L-L_dt)=c. 

The problem is that we cannot guarantee that the supremum of radii of all non-Ahlfors balls 
centered at y be less than 2 J+1 for every y G M. d . Our measure may not have this uniform property. 

So, generally speaking, we are unable to show that the integral operator with kernel kj(x, y) acts 
in L 2 (M. d ; fi). But we do not need that much! We only need to check that the corresponding bilinear 
form is bounded on two given functions F and G. So, we are not interested in the points y G M rf 
for which G(y) = (or in the points x G M. d , for which F(x) = 0). But, by definition, G can be 
non-zero only on transit cubes in T>2- Here we use our convention that we omit in all sums the fact 
that Q, R are transit cubes. But they are! 

Now let us notice that if (and this is the case for all R in the sum we estimate in our lemma) 
R G T>%, then the supremum of radii of all non-Ahlfors balls centered at y G R is bounded 
by c(d)£(R) for every y G R. Indeed, this is just Lemma 10. The same reasoning shows that if 
Q G V\ r , then the supremum of radii of all non-Ahlfors balls centered at x G Q is bounded by 
2.J-fc+i < 2J+ 1 whenever F{x) ^ 0, and we are done with Lemma 22. □ 

Now, we hope, the reader will agree that the decision to declare the cubes contained in H terminal 
was a good one. As a result, the fact that the measure fi is not Ahlfors did not put us in any real 
trouble - we barely had a chance to notice this fact at all. But, it still remains to explain why we 
were so eager to have the extra condition 

on our Calderon-Zygmund kernel. The answer is found in the next two sections. 



6. Short Range Interaction and Nonhomogeneous Paraproducts: Controlling 

Term a 3 . 

Recall that the sum o~ 3 is taken over the pairs Q, R, for which £{Q) < 2~ r £(R) and Q(~\R^%. We 
would like to improve this condition and demand that Q lie "deep inside" one of the 2 d subcubes 
Rj (j = 1, . . . , 2 d ). Recall also that we defined the skeleton sk R of the cube R by 

skR := [j dRj. 
3=1 

We have declared a cube Q G T>\ bad if there exists a cube R G T>2 such that £(R) > 2 r £(Q) and 
dist(Q, skR) < £(Q) a £(Ry- a . Now, for every good cube Q G £>i, the conditions £{Q) < 2~ r £(R) 
and Q D R 7^ together imply that Q lies inside one of the 2 d children Rj of R. We will denote 
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this subcube by Rq . The sum <t 3 can now be split into 



E (V> TA ^) 



J.erm 

Q,R ■ QCR, t{Q)<2~ r 1(R),R.q is terminal 

and 

Q,R:QcR,£{Q)<2~ r £{R),R Q is transit 

6.1. Estimation of cr| erm . First of all, write (recall that Rj denotes a child of R): 



term 
°3 



E E ( A g/> rA ^). 



id 



Clearly, it is enough to estimate the inner sum for every fixed j = 1, ... ,2 . Let us do this for 
j = 1. We have 

E ( A q/> TA ^)= E E ( A g/' TA ^)- 

Q,R: £(Q)<2- r i(R),QcRieV t 2 erm R:R! eX>* erm Q:l(Q)<2- r i(R),QcRi 

Recall that the kernel A; of our operator T satisfies the estimate of Lemma 9 

1 

1{R) 

^"Z 1 (]R<V) 



\k(x,y)\ < -— for all x € Ri,y G M d . (6.1) 



Hence, 



and therefore 



TA R g(x) | < lW for all x e R u (6.2) 



II TA II < HA II V v( R l) 

HXri ^^l^rV) - 11 -R ffll £ 1 (R<V) £(i?) m 

~ ^(i?) m " R5, '^ 2 (R d ;M) ~ -R L 2 (R d ;/i) ' 

This follows because || A R g \\ Ll , Rd . < V^C^H^sll^^) and v( R i) ^ hold trivially. 

Additionally, we used the observation that, by Lemma 10 we have 

K R ) < C(d)£(R) m (6.3) 

because R (the father of the cube R\) is a transit cube if i?i is terminal. 

Now, recalling Lemma 13, and taking into account that Agf = outside Q, we get 

E l(A Q /,TA^)|= £ |(A Q /, Xfli .TA^)| 

Q-.QcRi Q.QcRi 



<C\\x Rl -TA R g\\ [ £ ||A Q /||^ 

Q-.QCR! 



So, we obtain 



L 2 (R d ;u) 



E E i( A «/^ A ^)i 

R:i?l6X>* e ™ Q:QCRi 
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^ E w a rA^J E ii a q/h 2 



L 2 (R d ;u) 

QcRi 



< c 



2 



E ii a -r 5 "l2(k^). . E E n A Q^ii j 

R: Ri£T>2 erm R:Rx&V^ rm Q:QCRi 



But the terminal cubes in T>2 do not intersect! Therefore every A^f can appear at most once in 
the last double sum, and we get the bound 



E E \(* Q f,T*A Rg )\ 
R-. Rievtf™ QcRi 

i _ ,i 



i (R L 2 (R ,/^) 



Lemma 13 has been used again in the last inequality. 
6.2. Estimation of o l { . Recall that 



°3 



E ( a q^* a ^)- 

Q,R:QcR,£(Q)<2- r £(R),R n is transit 



Split every term in the sum as 

(A Q f,TA R g) = (A Q f,T( X A R g)) + (A Q f,T*U R A R g)). 

Observe that since Q is good, Q C R, and ^(Q) < 2~ r £(R), we have 

dist(Q,suppx A fi5 ) > dist(Q,s£;#) > l(Q) a l{R) 1 ' a . 
Q 

Using Lemma 20 and taking into account that the norm ||x mp k d does not exceed 

IIA^^H^ d y we conclude that the sum 

E K A Q /' r *(w A ^))i 

Q,R:QcR,e(Q)<2- r e(R),R Q is transit Q 

can be estimated by the sum (5.1). Thus, our task is to find a good bound for the sum 

E ( A Q f^(x R A R g)). 

Q,R:QcRJ(Q)<2- r e(R),R Q is transit J 

Recalling the definition of A R g and that R n is a transit cube, we get 



where 



Xr q Ar9 Wv 



c r q = (9) R - (g)i 



is a constant. So, our sum can be rewritten as 



E c R Q ( A Q f> T *(x R ))• 

Q,R:QcR,£(Q)<2~ r £(R),R Q is transit W 

Our next goal will be to extend the function x R to the function 1 in every term. 
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Let us observe that 



(A Q /,T*( X ))=! k(x,y)A Q f(x)d t ,(x)dp(y) 



Q 

[k(x,y) - k(x n ,y)]A f(x)dfj,{x) dp{y). 



Note again that for every x € Q, y S M. d \ Rq, we have 

\x Q -y\ > £ -^ + dist(Q,R d \R Q ) > C£{Q) > 2\x - x ( 



Therefore, 

\ x ~ x n\ T f(O) 

\k(x,y) - k(x Q ,y)\ < Ccz _ Q < C cz ^ 

\ Jj q y\ \- L o y\ 

and 



Im+T ' 



Now let us consider the sequence of cubes R(j) G V 2 , be ginning with R^ = R^ and gradually 

ascending (R^ C RV +1 \ £(R^ +1 ^) = 2i(R^)) to the starting cube R° = RW of the lattice V 2 . 
Clearly, all the cubes Ry^ are transit cubes. 
We have 

d^y) = f dfijy) _ A r dfi(y) — V^j 



Pq ~ V\"^' JR°\R Q \x Q - V\"^' ^ JrU)\rU-D \x q - y\"™ — 

Note now that, since Q is good and £(Q) < 2~ r £(R) < 2~ r £(R^) for all j = 1, . . . , N, we have 
dist(Q,i?^ > dist(Q, skR^) > £{Q) a £(R^) l ~ a . 

Hence 



Ij ~ [£(Q) a £(R(3)y-a]m+T J RU) d ^ 
Recalling that a = 2(m+T) ' we see that the first factor equals 

1 

£{Q) T 2£{R(3)) m+T 2 ' 

bmce is transit, we have 



Thus, 



dp = n(R (j) ) < C£{R {j) ) r 

Rd) 



Ij < C - T 1 — r = C2-^ L 2. 



1 ~ l{Q)*l{RV)j* ~ l{Q)U(Rf. 
Summing over j > 1, we get 



r d/ijy) = y> j. < C _L 1 

M« Q I*q " ?/l m+T ^ 3 ~ 1 - 2-5 £(Q)^(i2)i ' 
Now let us note that 

< l|Aflg|| L i (flo , M) < \\A R g\\ L 2 {RQtil) 
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We finally obtain 

|(A Q /,T*( Xra>(AD ))| < 

and 



C 



1 - 2"2 



£(Q) 

£{R) 



/j(Rq) Q L 2 (R d ;fi) R L 2 (R d ;fj.) 



E 



,R:QcR,e(Q)<2- r £(R),R c .is transit 



j = l Q,R:QcRj 
We next define a linear operator on £ 2 — > ^ 2 



l^g|-K A Q /^*(W))l 



£(R) 
given by 



Q,R 



KQ) 

£{R) 



1 KQ) 



1 KQ) 
v(Rj) 



(Q c R! 



}^K 2 (R d ]f i)^R 9 K 2 (R d ^)' 



Key to the rest of this section is demonstrating that this is a bounded operator on £ 2 . 
Lemma 24. For any two families {uq} q e -ptr an d {b R } ReT)t r °f nonne 9 a ^ ve numbers, one has 

Q,R:QcRi Q R 

Proof. Let us "slice" the matrix Tq r according to the ratio ||^y . Namely, let 



Q,R 



if Q C Ri, £{Q) = 2~ k £(R); 



Q,R' 

0, otherwise 



(k = 1, 2, . . . ). It is enough to show that for every > 0, 

E^'v^-lE^ta] 1 - 



The matrix {Tq R } has a very good "block" structure: every can interact with only one b R . So, 
it is enough to estimate each block separately, i.e., to show that for every fixed R € T>2 , 



E 

:QdRi,l(Q)=2- k l{R) 



r 5* 4 /ifM a ^ ra <2-5*[£ a 2] 



'R- 



Because the cubes Q € T>\ of fixed size do not intersect, we have that 

]T M (Q) < /i^). 

Q:QcRi,£(Q)=2- k £(R) 

Reducing both parts by the non-essential factor 2~^ k b R and using the above observations we see 
that the desired estimate follows from 



E 



Q:QcRiJ(Q)=2- k £(R) 



1 KQ) 



a Q ^ 



< 



e _ 



Q:QcR!,£{Q)=2- k £(R) 



,2 I 2 

Q 
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This completes the proof. 



□ 



Remark 25. We did not use here the fact that the sequences {oq} and {6r} are supported on 
transit cubes. We actually proved the following Lemma. 

Lemma 26. The matrix {T^ R } defined by 



i(Q) 
e(R) 



(Q c Ri), 



Q,R ■ 

generates a bounded operator in I 2 . 

We just finished estimating an extra term which appeared when we extended \r to 1. So, the 

H Q 

extension of x R to the whole 1 does not cause too much harm, and we get the sum 

H Q 

E %( A Q/' Tn ) 

Q,R:QcR,i(Q)<2~ r e(R),R Q is transit 

to estimate. Note that the inner product (A^/, T*l) does not depend on R at all, so it seems to 
be a good idea to sum over R for fixed Q first. 
Recalling that 



"Rq 



(g) R -(g) 



R 



and that Kg = 
above sum, 



(g) r0 = 0, we conclude that for every Q € T>{ that really appears in the 



if, : RDQ, e(R)>2 m e(Q),R Q is transit 



"Rq 



(g) 



Rq 



Definition 27. Let R(Q) be the smallest transit cube R E T>2 containing Q and such that i(R) > 
2 r £(Q). 



So, we obtain the sum 



to take care of. 



E (9) R{Q) (A Q f,T*l 



■.e(Q)<2- r e(R) 



Remark 28. Let us recall that we had the convention that says that Q here are only good ones 
and, of course, they are only transit cubes. The range of summation should be Q € £>f*, Q is good 
(default); there exists a cube R G such that £(Q) < 2~ r £(R), Q C R and the child R^ (the 
one containing Q) of it! is transit. In other words, in fact, the sum is written formally incorrectly. 
We have to replace R(Q) by Rq in the summation. However, the smallest transit cube containing 
Q (this is R(Q)) and the smallest transit child (containing Q) of a certain subcube R of R° (this 
child is Rq) are of course the one and the same cube, unless R(Q) = R°. Thus the sum formally 
has some extra terms corresponding to R(Q) = R°, but fortunately they all are zeros! Since we 
work now with g such that Kg = (recall that Kg denotes the average of g with respect to /x), so 

(g)R(Q) = if R(Q) = R°. 
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6.3. Pseudo-i3A/0 and a Special Paraproduct. To introduce the paraproduct operator, we 
rewrite our sum as follows 

E ^ (Q) (A Q /,Tn)= £ {g ) R{Q) (f,A* Q T*l) 

<2~ r e(R) Q:£(Q)<2- r e(R) 

= (/> E (9) R{Q) A Q T*1). 

)<2-''£(i?) 



Here, we used the fact that Aq = Aq. The term that appears in the pairing with / is ubiquitous 
enough that it deserves to be singled out. 

Definition 29. Given a function F, we define the paraproduct operator with symbol F that acts 
on L 2 (M. d ; fi) functions via 

Ii F g{x):= U)R E A F(z). 

ReV 2 ,RcR° Q£V t 1 r ng 1 ,£(Q)=2- r £{R) 

As in the case of homogeneous harmonic analysis, the behavior of the paraproduct operator Up 
is dictated by the membership of F in a suitable li BMO" space. 

Definition 30. A function F € L 2 (R d ;fi) belongs to "pseudo-SMO" if there exists A > 1 such 
that for any cube Q such that fi(sQ) < K s m i(Q) m , s > 1, we have 

\F(x)-(F) Q \ 2 dfi(x)<Cfi(XQ). 

We next demonstrate that the image of 1 under the action of T* is an element of pseudo- -BMO. 
Lemma 31. Let /x, T satisfy the assumptins of Theorem 1. Then 

T*l £ pseudo-BMO. (6.5) 
Here C depends only on the constants of Theorem 1. 

Proof. For x € Q we write T*l(x) = (T*x\q)(x) + (T*Xr<i\\q)(x) =: ip(x) +ip(x). First, we notice 
that 

x,y e Q ^ \^(x) - ^(y)\ < C(K,X,t) , 
where K is the constant from our definition above. This is easy: 



\tj){x) - ip(y)\ 



/> OO r„ 

< / \k(x,t)-k(y,t)\dn(t) = J2 \k(x,t)-k(y,t)\d[i(t) 
JR d \XQ ~[ J \j +1 Q\\iQ 

1= -i wi(Q)y 



* E fJfflL K(XH(Q)r < C(K,X,r) . 



Therefore, 



But, 



(x)-(4,) Q \ 2 dv(x) <Cfi(Q) 



|^)-(^) Q |^ M (x)<4 / \T* X x Q (x)\ A dn(x)<A»(XQ) 

JQ 

by the Tl assumption, Condition (1.1), of Theorem 1. 



□ 



Lemma 32. Let \x, T satisfy the assumptions of Theorem 1. Then 

ll n T*i||L2( R d. M )_> L 2( R d ;At ) < C . (6.6) 
Here C depends only on the constants of Theorem 1. 
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Proof. Let F = T*l. In the definition of Hp since all Aq are mutually orthogonal, it is easy to see 
that 

ReT> 2 , RcR° Qev t 1 r ng l ,£(Q)=2- r e{R) 

Put 

a R '■= II ^Q F \\L a (J3L d ;tJt) ■ 

Qevfngi,i(Q)=2- r i(R) 

By the Carleson Embedding Theorem, it is enough to prove that for every S € T>2 

J2 <*R<Cn(S), (6.7) 
R€V 2 ,RCS 

or equivalently, 

]T || A Q F|| i2(Rd;/i) <Cfi(R). (6.8) 

Qev t 1 r ,e(Q)<2- r e(R),dist(Q,dR)>e(Q)°'e(R) 1 - a 

Perform a Whitney decomposition of R into disjoint cubes P, such that 1.5P C R, and 1.4P 
have only bounded multiplicity C(d) of intersection. Then, consider the sums 

sp ■= Yl II A Q- f IIl2(r<V) ■ ( 6 - 9 ) 

QeV[ r ,£{Q)<2-r£(R),QuP^<l),dist{Q,dR)>£(Q) a £(R) 1 - a 

This sp can be zero if there is no transit cubes as above intersecting it. But if sp ^ then 
necessarily 

KP) < C(d)£(P) m , 

and moreover 

(i(sP) < C(d)s m £{P) m , Va> 1. 

In fact, in this case P intersects a transit cube Q, which by elementary geometry is "smaller" 
than P, so £{Q) < C(r, d)£(P). But, then the above inequalities follow from the definition of transit. 
It is also clear that for large r and for Q and P as above 

QnP^0=*Qcl.2P. 

Therefore, 

Sp ^ =► s P < Y ||A<ji? , || £r a (R «i. At j . 

So, 

Sp ^0^ sp < [ \F(x)-{F) 1 . 2P \ 2 dfi(x)<Cfi(lAP). 

J\.2P 

The last inequality follows from Lemma 31. 

Now we add all sp's and we get that this is less than or equal to C ^ /j,(1AP). This is smaller 
than C\ [i(R) as 1.4P's have multiplicity of intersection C(d) < oo. 

□ 
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7. The Diagonal Sum: Controlling Term <ti. 
To complete the estimate of \(Tf gooc [, g g0 od)\ in only remains to estimate o\. But notice that 



ll A 0/llLi(Rd ;M ) < ||AQ/|| L 2 (R d. M )VM<9)> \\Ar9\\lI(M.1;») < \\ A R9\\L*(Mdw)VKK) ■ 

Remember that all cubes Q and R in all sums are transit cubes. In particular, in a± we have that 
Q and R are close and of almost the same size. If a son of Q, S(Q), is terminal, then by Lemma 9 



\( T XS(Q) A Qf^B.9)\ < ^^jdr " H A Q/llL 2 (R d ; At )ll A flg|lL 2 (R d ; / .) • 

So if a son of Q is terminal but Q and i? are transit, then /i(Q) < C £(Q) m ~ £(R) m . Summing 
such pairs (and symmetric ones, where a son of R is terminal) we get C l (r)||/|| L 2( ]K d ;At )|||5f|| i 2(i R[ i ;/J ). 

We are left with the part of a\, where we sum over Q and R such that their sons are transit. 
Then we use the pairing 



\(T X s(Q)A Q f,xs(R)&rg)\ < A 2 \c s{Q) I \c s[R) I Vv(S(Q))Vv(S(R)) . 
The estimate above follows from our Tl assumption in Theorem 1. Now use (6.4) to obtain 

\( T XS(Q) A Qf,XS(R)^R9)\ < C\\AQf\\L2(Rd;fi)\\ A R9\\L2(^;ti) ■ 

This completes the estimate of term o\ and thus the proof of Theorem 1. 



8. The Proof of Theorem 3 

The proof of Theorem 3 repeats verbatim that of Theorem 1 . We need only to provide ourselves 
with the family of "dyadic" lattices in metric A of cubes Q, which satisfies the following property. 

First, consider the case when d = 2. Then, we have that the metric A(x,y) is comparable 
to the following quasi-metric. Let z £ {z £ C d : 1/2 < \z\ < 2} be thought of as (\z\, A), 

and then we have A (J\z\, r|) , (|H ^y)) = IN - Ml + P( jfr » where PiCv) = I 1 ~ f ■ ^1 
is the spherical metric between points on the boundary of the unit ball. For a point on the 
sphere we can introduce local coordinates p = (r, c), where r is real and c is a complex number 
p(pi,P2) = \ r i — r 2\ + |ci — C2 1 2 - Thus, the metric A(x,y) is equivalent to the quasi-metric in M 4 , 
given by A(sc,y) = \xi - yi\ + |z 2 - V2\ + |»3 ~ 2/3| 2 + |»4 - 2/4 1 2 - 

As can be seen from the proof of Theorem 1, the main result will still hold for Calderon-Zygmund 
kernels with respect to this quasi-metric. The only issue that is not clear is the construction of 
the dyadic lattices. To obtain these, one starts with the unit cube and sub-divides two of the 
sides by a factor of 2, while the other two sides are subdivided by a factor of 4 (these correspond 
to the structure of the metric A). This can then be repeated. These are then shifted using the 
standard translations in M 4 . This then gives the dyadic lattices of interest. We then are only left 
with checking the probability statements (Theorem 15), and this is a straightforward modification 
of what appears in this paper. We sketch the details now. 

In this case, the geometry of a cube Q is governed by a parameter q, which determines two of its 
sides (the other two are then q 2 ). Similarly for R, it has a parameter r determining its geometry. 

Remark 33. Notice that A-diameters (A-size £(Q),£(R), so to speak) of Q,R are equivalent corre- 
spondingly to q 2 ,r 2 . This is because of the structure of the quasi-metric A. 

Again, one would consider the parameters of the Calderon-Zygmund kernel r and m (but with 
respect to the metric A), and set a = 2T +2m • ^ e ^ x a sman number 5 > 0, and S > 2 chosen 
momentarily, and choose an integer r such that 

2- r < 5 s < 2~ r+1 . 
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Then a cube Q is called <5-bad if there exists a cube R such that 

(i) r > 2'q; 

(ii) dist A (Q,dR) < ^ x 2 ^~ a h 

One then can prove the following Proposition 

Proposition 34. One can choose S = S{a) in such a way that for any fixed Q € Pi, 

F^iQis bad} < 5 2 . (8.1) 
By symmetry F UJl {R is bad} < 5 2 for any fixed R £ T>2- 

The key computation to observe to prove this Proposition is the following. Fix an integer k > r, 
and let us estimate the probability that there exists a cube R € T>2 of parameter t = 2 fc q such that 
dist x (Q,dR) < l(Q) a l(R) l - a = q^r^ 1 ""). Geometry shows that it is equal to the ratio of the 
(usual) volume of the narrow strip of A-distance 2 2 ( 1_Q ) fc q 2 around the boundary of the cube it! to 
the whole volume of the cube with parameter 2 fc q. 

The volume of the strip around the boundary is the sum of slabs adjoint to the faces of cube R 
of parameter r = 2 fc q. There are faces of two types: type (1) — it has (Euclidean!) measurements 
txtxt 2 , type (2)— it has (Euclidean!) measurements r x r 2 x r 2 . The slabs consist of those points, 
whose A-distance to faces is at most 

e .- q 2a r 2(l-a) _ 

For faces of type (1), the slab with this property has (Euclidean!) thickness at most C e. For faces 
of type (2), the slab will obviously have the (Euclidean!) thickness at most Ce 1 / 2 . Consequently, 
this implies the volume of the strip is at most 

C(e-r 2 -t-t + e 1/2 • t • t 2 • t 2 ) . 

Plugging r = 2 fc q and e = 2 fc ( 2-2a )q into the above we obtain that the volume of the strip is at most 

q ^2 fc ( 2 - 2a + 4 ) -|- 2 1_a+5 )q 6 

Or, it is at most Cq 6 2 6fc - Qfc (h ere C is an absolute constant), while the volume of the cube with 
parameter 2 fc q is 2 6fc q 6 . We conclude that this ratio is less than C2~ ka . Therefore, the probability 
that the cube Q is bad does not exceed 

00 Q — VOL 

CV2" to = C . 

1 _ 2-" 

k=r 

Since the integer r was chosen so that 2~ r < 5 s < 2~ r+1 we will then simply choose the minimal 
5 = S(a) such that ^J 2 - a — ^ 2 course ) S = 3/ a is enough for all small S's). This construction 
of course works when d = 2n with appropriate modifications to the geometry of the sets. 

9. Concluding Remarks 

The considerations explored in this paper can be extended to the case of metric spaces. This was 
carried out by the authors in [26]. There is also recent related work by Hytonen and Martikainen, 
[8, 9] where they prove Theorems of a similar nature. In particular the show "Tb" theorems on 
quasimetric spaces equipped with upper doubling measures. As an application of their techniques, 
they are able to obtain the same characterization of Carleson measures for the B 2 (M>2d), though, 
not our abstract Theorem 1. 

Finally, we remark that it is anticipated that the method developed in this paper, and in addition 
the tools of non-homogeneous harmonic analysis, should have other immediate applications. 
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